CCACGCGTCCGCGCGGGAGCTGCTTGGAGGCTCGGCGGCCGGGAGGAGGCCGGGGCCACGCTTCTTGGAA 

GCTACTGAGTGACTTCTTTGAAGAACCATGAAGTCACACTATATTGTGCTAGCTCTAGCCTCCCTGACGO? 

TCCTGCTGTGTCTCCCCGTGTCCCAGAGCTGTAACAAAGCACTCTGTGCCAGCGATGTGAGCAAATGCCT 

CATTCAGGAGCO^CTGCCAGTGCCGGCCTGGAGAAGGGAACTGCCCCTGCTGTAAGGAGTGCATGCTGTGC 

CTCGGGGCCCTGTGGGACGAGTGCTGCGACTGO^GTCGGTATGTGCAACCCTCGGAATTACAGCGACACCC 

CGCCCACAa?CCAAGAGCACCGTGGAGGAGCTGCACGAGCCCATTCCGTCCCTGTOCAGGGCGCTGACGGA 

GGGCGACACCCAGCTGAACTGGAACATCGTCTCC'TTCCCTG'rGGCAGAGGAGCTGTCACACCATGAAAAC 

CTAGTCTCCTTCCTAGAAACTCTGAACC&GCTGCACCACCAAAACGTG1?CTGOTCCCAGC^ 

ACGCCCCCTTCCCCAGCGACAAAGAGCGCATGTGCACAGTGGTTTACTTTGATGACTGCATGTCCATCCA 

CCAGTGTAAGATATCCa^GCGAATCCATGGGTGCATCCAAGTAO'CGCTGGTTTCACAACGCCTGCTGCGAG 

TGCATCGGTCCAGAGTGCATa^GACTATGGGAGTAAAACTGTCAAGa'Ga?ATGAACTGCATGTTTTA^ 

GGGGAAGAAATGCAAACCAAAGCAAGAACGCTGACGATGCCGTTGACACTCCTACCTTGCAGGATGTCCC 

CACAGAGTCCACTGGACCAGC1?TGTACAAGGGTGAAACCTTTCTGCGTCACCTTOCTGCTTTTGCAGAGC 

CATGACAGCAGCTGCCTTACAGCGTGATTTAGCAGACAGACCTGACAGGTGTGTCATGTGACAGGCAGCC 

ATCAGGGTGGCCTAGCACATTOGl^GGG^rGC'TGA'irGTOGTAT'rGCACTGTCAAAGACTGACCAGTCCTGAT 

AGGGCTGGAAAGAGGAAAGGGGTGTGGGAAGGTGACTTTCTTAGCTCTCCCGTAGCAGCACACTGTCATA 

GTGTCTCAGTGTCATATTGCTGTGAAGAGACACCACAACCAAGGCAGCTTATGAAGAAAGCCCTTAATTA 

GGGCTCGCTTTACAGTTCCGAGGTTACTTCCTGGTCGTCGTGGTGGGGATGAGGCAGTAGGTAGCAGGCA 

TGGTGCTGGAGCAGTAGCTGAATGTTGGTTCACAAACTTAGCACCTCGAGTCATGATAGAAACACCATCC 

AGGCCTTGAAGCCCAACACGTAGCAACACCACTTGCCAAAGCAGGATCAAGGCTCTCTCAGCGTCCCAGG 

ATGCAGa?a?GGGCAAGTTCTTGAGTGAAAl'TCCTATACATACTAACTCTGCAATTTTGCTTCTATAGTTCC 

TCTTCTTGTTAACTATAI^GTCCAGAACCATTTCAGAATATGG 

GTTraTGTGAATAAAAAAAATAAAAAGGCCTCGTCTGCATAGATTGGGCAAGT 

CTAACAGTAAAGACTTTCTATTOGGCTTOAGGAGTATCCAAGGGGTGGTCTCTCGTGAGAGTACCTCGCA 
ACAGCAGCAGATGGTGTTGGAGGCTTGGCCTGCTGTGGGCT1?TTGAAACCTTAAAGTCCACCCCCAGTGA 
CACGCCTCCTCCAATAACa.CCACACCTCCTAATCCTOTCTAAGTAGTCCC^CAACTGGGAACCAAGC^ 
CAGATATACGAGCCCACAAAGGCCATTCTCGTTCAAACCACCACATGTAATAAAATATAO'GCCACGTCAA 
AAAAAAAAAAAAAAAAA (SEQ ID H0:1) 

MKSHYIVLALASLTFLLCLPVSQSCNKALCASDVSKCLIQELCQCRPGEGNCPCC 
KECMLCLGALWDECCDCVGMCNPRNYSDTPPTSKSTVEELHEPIPSLFRALTEG 
DTQLNWNIVSFPVAEELSHHENLVSFLETVNQLHHQNVSVPSNNVHAPFPSDKE 
RMCTVVYFDDCMSIHQCKISCESMGASKYRWFHNACCECIGPECIDYGSKTVKC 
MNCMF (SEQ ID N0:2) 



FIG.l 



red underlined = deleted in targeting construct 

green = sequence flanking Neo insert in targeting construct 



CCACGCGTCCGCGCGGGAGCTGCTTGGAGGCTCGGCGGCCGGGAGGAGGCCGGGGCCACG 

CTTCTTGGAAGCTACTGAGTGACTTCTTTGAAGAACCATGAAGTCACACTATATTGTGCT 

AGCTCTAGCCTCCCTGACGTTCCTGCTGTGTCTCCCCGTGTCCCAGAGCTGTAACAAAGC 

ACTCTGTGCCAGCGATGTGAGCAAATGCCTCATTCAGGAGCTCTGCCAGTGCCGGCCTGG 

AGAAGGGAACTGCCCCTGCTGTAAGGAGTGCATGCTGTGCCTCGGGGCCCTGTGGGACGA 

GTGCTGCGACTGTGTCGGTATGTGCAACCCTCGGAATTACAGCGACACCCCGCCCACATC 

CAAGAGCACCGTGGAGGAGCTGCACGAGCCCATTCCGTCCCTGTTCAGGGCGCTGACGGA 

GGGCGACACCCAGCTGAACTGGAACATCGTCTCCTTCCCTGTGGCAGAGGAGCTGTCACA 

CCATGAAAACCTAGTCTCCTTCCTAGAAACTGTGAACCAGCTGCACCACCAAAACGTGTC 

TGTTCCCAGCAACAATGTCCACGCCCCCTTCCCCAGCGACAAAG[AGCGCATGTGCACAGT 

GGTTTACTTTGATGACTGCATGTCCATCCACCAGTGTA 1 AGATATCCTGCGAATCCATGGG 

TGCATCCA AGTATCGCTGGTTTCACAACGCCTGCTGCGAGTGCATCG [ GTCC AGAGTGC AT 

TGACTATGGGAGTAAAACTGTCAAGTGTATGAACTGCATGTTTTAAAGAGGGGGAAGAAA 

TGCAAACCAAAGCAJAGAACGCTGACGATGCCGTTGACACTCCTACCTTGCAGGATGTCCC 

CACAGAGTCCACTGGACCAGCTTGTACAAGGGTGAAACCTTTCTGCGTCACCTTTCTGCT 

TTTGCAGAGCCATGACAGCAGCTGCCTTACAGCGTGATTTAGCAGACAGACCTGACAGGT 

GTGTCATGTGACAGGCAGCCATCAGGGTGGCCTAGCACATTTGTGGGTGCTGATGTTGTA 

TTGCACTGTCAAAGACTGACCAGTCCTGATAGGGCTGGAAAGAGGAAAGGGGTGTGGGAA 

GGTGACTTTCTTAGCTCTCCCGTAGCAGCACACTGTCATAGTGTCTCAGTGTCATATTGC 

TGTGAAGAGACACCACAACCAAGGCAGCTTATGAAGAAAGCCCTTAATTAGGGCTCGCTT 

TACAGTTCCGAGGTTACTTCCTGGTCGTCGTGGTGGGGATGAGGCAGTAGGTAGCAGGCA 

TGGTGCTGGAGCAGTAGCTGAATGTTGGTTCACAAACTTAGCACCTCGAGTCATGATAGA 

AACACCATCCAGGCCTTGAAGCCCAACACGTAGCAACACCACTTGCCAAAGCAGGATCAA 

GGCTCTGTCAGCGTCCCAGGATGCAGTTGGGCAAGTTCTTGAGTGAAATTCCTATACATA 

CTAACTCTGCAATTTTGCTTCTATAGTTCCTCTTCTTGTTAACTATATGTCCAGAACCAT 

TTCAGAATATGGGTTTTTGTGAATAAAAAAAATAAAAAGGCCTGGTCTGCATAGATTGGG 

CAAGTTCCCTTGCAGCCACAGGCTAACAGTAAAGACTTTCTATTTGGCTTTAGGAGTATC 

CAAGGGGTGGTCTCTCGTGAGAGTACCTCGCAACAGCAGCAGATGGTGTTGGAGGCTTGG 

CCTGCTGTGGGCTTTTGAAACCTT7VAAGTCCACCCCCAGTGACACGCCTCCTCCAATAAC 

ACCACACCTCCTAATCCTTTCTAAGTAGTCCCTCAACTGGGAACCAAGCATTCAGATATA 

CGAGCCCACAAAGGCCATTCTCGTTCAAACCACCACATGTAATAAAATATATGCCACGTC 
AAAAAAAAAAAAAAAAAAA (SEQ ID N0:1) 



FIG. 2A 



Gene Sequence 
Structure * 



639 bp 



Sequence Deleted 



707 bp 



Size of full-length 
cDNA: 1879 bp 




Targeting Vector* (genomic sequence) 
Construct Number: 203 



Arm Length: 



5* arm 



Neo 

Cassette 



3* arm 



5': 1.9 kb 
3": 3.6 kb 




Targeting Vector 
Endogenous Locus 



* Not drawn to scale 



5 ' >AACAAATGAAGATCTTTTGTC ' 
CCTCGTTTTTGTCCTGTGCTGATG 
AGCAGTAAGAGGGCTAGAAAGTAA : 
CTGCAGGTATTCTCTGAGCAAGCG 
AGCGAGTGGCCGAGCTCTTCCTGC , 
CTGTACTGAAATGTCCCTTTGCAT 
TTCAGAGCGCATGTGCACAGTGGT 
TTACTTTGATGACTGCATGTCCAT 
CCACCAGTGTAO ' (SEQ ID 
NO: 3) 



5 ■ >GTCCAGAGTGCATTGACTATG 
GGAGTAAAACTGTCAAGTGTATGA : 
ACTGCATGTTTTAAAGAGGGGGAA \ 
GAAATGCAAACCAAAGCAGTAAGT I 
CATGAAGTGTGCAGAAATCTTGGT 
TCTGGTATGCTAGGAGTGTGTTAA 
GTTATATGATTGTAACTGTGCTTT 
TTATATCTGGTGCCTATTAGTGTA 
GGTCTTTTCCA<3 ' (SEQ ID 
NO: 4) 



FIG. 2B 



